Your browser doesn't support javascript.
Mostrar: 20 | 50 | 100
Resultados 1 - 10 de 10
Filtrar
1.
J Am Med Inform Assoc ; 30(7): 1305-1312, 2023 06 20.
Artigo em Inglês | MEDLINE | ID: covidwho-2325541

RESUMO

Machine learning (ML)-driven computable phenotypes are among the most challenging to share and reproduce. Despite this difficulty, the urgent public health considerations around Long COVID make it especially important to ensure the rigor and reproducibility of Long COVID phenotyping algorithms such that they can be made available to a broad audience of researchers. As part of the NIH Researching COVID to Enhance Recovery (RECOVER) Initiative, researchers with the National COVID Cohort Collaborative (N3C) devised and trained an ML-based phenotype to identify patients highly probable to have Long COVID. Supported by RECOVER, N3C and NIH's All of Us study partnered to reproduce the output of N3C's trained model in the All of Us data enclave, demonstrating model extensibility in multiple environments. This case study in ML-based phenotype reuse illustrates how open-source software best practices and cross-site collaboration can de-black-box phenotyping algorithms, prevent unnecessary rework, and promote open science in informatics.


Assuntos
Boxe , COVID-19 , Saúde da População , Humanos , Registros Eletrônicos de Saúde , Síndrome de COVID-19 Pós-Aguda , Reprodutibilidade dos Testes , Aprendizado de Máquina , Fenótipo
2.
Nat Commun ; 14(1): 2914, 2023 05 22.
Artigo em Inglês | MEDLINE | ID: covidwho-2322120

RESUMO

Long COVID, or complications arising from COVID-19 weeks after infection, has become a central concern for public health experts. The United States National Institutes of Health founded the RECOVER initiative to better understand long COVID. We used electronic health records available through the National COVID Cohort Collaborative to characterize the association between SARS-CoV-2 vaccination and long COVID diagnosis. Among patients with a COVID-19 infection between August 1, 2021 and January 31, 2022, we defined two cohorts using distinct definitions of long COVID-a clinical diagnosis (n = 47,404) or a previously described computational phenotype (n = 198,514)-to compare unvaccinated individuals to those with a complete vaccine series prior to infection. Evidence of long COVID was monitored through June or July of 2022, depending on patients' data availability. We found that vaccination was consistently associated with lower odds and rates of long COVID clinical diagnosis and high-confidence computationally derived diagnosis after adjusting for sex, demographics, and medical history.


Assuntos
COVID-19 , Síndrome de COVID-19 Pós-Aguda , Estados Unidos/epidemiologia , Humanos , COVID-19/epidemiologia , COVID-19/prevenção & controle , Vacinas contra COVID-19 , Estudos de Coortes , SARS-CoV-2 , Vacinação
3.
Clin J Am Soc Nephrol ; 2023 May 03.
Artigo em Inglês | MEDLINE | ID: covidwho-2320701

RESUMO

BACKGROUND: AKI is associated with mortality in patients hospitalized with coronavirus disease 2019 (COVID-19); however, its incidence, geographic distribution, and temporal trends since the start of the pandemic are understudied. METHODS: Electronic health record data were obtained from 53 health systems in the United States in the National COVID Cohort Collaborative. We selected hospitalized adults diagnosed with COVID-19 between March 6, 2020, and January 6, 2022. AKI was determined with serum creatinine and diagnosis codes. Time was divided into 16-week periods (P1-6) and geographical regions into Northeast, Midwest, South, and West. Multivariable models were used to analyze the risk factors for AKI or mortality. RESULTS: Of a total cohort of 336,473, 129,176 (38%) patients had AKI. Fifty-six thousand three hundred and twenty-two (17%) lacked a diagnosis code but had AKI based on the change in serum creatinine. Similar to patients coded for AKI, these patients had higher mortality compared with those without AKI. The incidence of AKI was highest in P1 (47%; 23,097/48,947), lower in P2 (37%; 12,102/32,513), and relatively stable thereafter. Compared with the Midwest, the Northeast, South, and West had higher adjusted odds of AKI in P1. Subsequently, the South and West regions continued to have the highest relative AKI odds. In multivariable models, AKI defined by either serum creatinine or diagnostic code and the severity of AKI was associated with mortality. CONCLUSIONS: The incidence and distribution of COVID-19-associated AKI changed since the first wave of the pandemic in the United States.

4.
Sleep ; 2023 May 11.
Artigo em Inglês | MEDLINE | ID: covidwho-2316915

RESUMO

STUDY OBJECTIVES: Obstructive sleep apnea (OSA) has been associated with more severe acute coronavirus disease-2019 (COVID-19) outcomes. We assessed OSA as a potential risk factor for Post-Acute Sequelae of SARS-CoV-2 (PASC). METHODS: We assessed the impact of preexisting OSA on the risk for probable PASC in adults and children using electronic health record data from multiple research networks. Three research networks within the REsearching COVID to Enhance Recovery initiative (PCORnet Adult, PCORnet Pediatric, and the National COVID Cohort Collaborative [N3C]) employed a harmonized analytic approach to examine the risk of probable PASC in COVID-19-positive patients with and without a diagnosis of OSA prior to pandemic onset. Unadjusted odds ratios (ORs) were calculated as well as ORs adjusted for age group, sex, race/ethnicity, hospitalization status, obesity, and preexisting comorbidities. RESULTS: Across networks, the unadjusted OR for probable PASC associated with a preexisting OSA diagnosis in adults and children ranged from 1.41 to 3.93. Adjusted analyses found an attenuated association that remained significant among adults only. Multiple sensitivity analyses with expanded inclusion criteria and covariates yielded results consistent with the primary analysis. CONCLUSIONS: Adults with preexisting OSA were found to have significantly elevated odds of probable PASC. This finding was consistent across data sources, approaches for identifying COVID-19-positive patients, and definitions of PASC. Patients with OSA may be at elevated risk for PASC after SARS-CoV-2 infection and should be monitored for post-acute sequelae.

5.
J Biomed Inform ; 139: 104295, 2023 03.
Artigo em Inglês | MEDLINE | ID: covidwho-2210676

RESUMO

Healthcare datasets obtained from Electronic Health Records have proven to be extremely useful for assessing associations between patients' predictors and outcomes of interest. However, these datasets often suffer from missing values in a high proportion of cases, whose removal may introduce severe bias. Several multiple imputation algorithms have been proposed to attempt to recover the missing information under an assumed missingness mechanism. Each algorithm presents strengths and weaknesses, and there is currently no consensus on which multiple imputation algorithm works best in a given scenario. Furthermore, the selection of each algorithm's parameters and data-related modeling choices are also both crucial and challenging. In this paper we propose a novel framework to numerically evaluate strategies for handling missing data in the context of statistical analysis, with a particular focus on multiple imputation techniques. We demonstrate the feasibility of our approach on a large cohort of type-2 diabetes patients provided by the National COVID Cohort Collaborative (N3C) Enclave, where we explored the influence of various patient characteristics on outcomes related to COVID-19. Our analysis included classic multiple imputation techniques as well as simple complete-case Inverse Probability Weighted models. Extensive experiments show that our approach can effectively highlight the most promising and performant missing-data handling strategy for our case study. Moreover, our methodology allowed a better understanding of the behavior of the different models and of how it changed as we modified their parameters. Our method is general and can be applied to different research fields and on datasets containing heterogeneous types.


Assuntos
COVID-19 , Humanos , Algoritmos , Projetos de Pesquisa , Viés , Probabilidade
6.
J Am Med Inform Assoc ; 29(7): 1172-1182, 2022 06 14.
Artigo em Inglês | MEDLINE | ID: covidwho-1795238

RESUMO

OBJECTIVE: The goals of this study were to harmonize data from electronic health records (EHRs) into common units, and impute units that were missing. MATERIALS AND METHODS: The National COVID Cohort Collaborative (N3C) table of laboratory measurement data-over 3.1 billion patient records and over 19 000 unique measurement concepts in the Observational Medical Outcomes Partnership (OMOP) common-data-model format from 55 data partners. We grouped ontologically similar OMOP concepts together for 52 variables relevant to COVID-19 research, and developed a unit-harmonization pipeline comprised of (1) selecting a canonical unit for each measurement variable, (2) arriving at a formula for conversion, (3) obtaining clinical review of each formula, (4) applying the formula to convert data values in each unit into the target canonical unit, and (5) removing any harmonized value that fell outside of accepted value ranges for the variable. For data with missing units for all the results within a lab test for a data partner, we compared values with pooled values of all data partners, using the Kolmogorov-Smirnov test. RESULTS: Of the concepts without missing values, we harmonized 88.1% of the values, and imputed units for 78.2% of records where units were absent (41% of contributors' records lacked units). DISCUSSION: The harmonization and inference methods developed herein can serve as a resource for initiatives aiming to extract insight from heterogeneous EHR collections. Unique properties of centralized data are harnessed to enable unit inference. CONCLUSION: The pipeline we developed for the pooled N3C data enables use of measurements that would otherwise be unavailable for analysis.


Assuntos
COVID-19 , Registros Eletrônicos de Saúde , Estudos de Coortes , Coleta de Dados , Humanos
7.
EBioMedicine ; 74: 103722, 2021 Dec.
Artigo em Inglês | MEDLINE | ID: covidwho-1536517

RESUMO

BACKGROUND: Numerous publications describe the clinical manifestations of post-acute sequelae of SARS-CoV-2 (PASC or "long COVID"), but they are difficult to integrate because of heterogeneous methods and the lack of a standard for denoting the many phenotypic manifestations. Patient-led studies are of particular importance for understanding the natural history of COVID-19, but integration is hampered because they often use different terms to describe the same symptom or condition. This significant disparity in patient versus clinical characterization motivated the proposed ontological approach to specifying manifestations, which will improve capture and integration of future long COVID studies. METHODS: The Human Phenotype Ontology (HPO) is a widely used standard for exchange and analysis of phenotypic abnormalities in human disease but has not yet been applied to the analysis of COVID-19. FUNDING: We identified 303 articles published before April 29, 2021, curated 59 relevant manuscripts that described clinical manifestations in 81 cohorts three weeks or more following acute COVID-19, and mapped 287 unique clinical findings to HPO terms. We present layperson synonyms and definitions that can be used to link patient self-report questionnaires to standard medical terminology. Long COVID clinical manifestations are not assessed consistently across studies, and most manifestations have been reported with a wide range of synonyms by different authors. Across at least 10 cohorts, authors reported 31 unique clinical features corresponding to HPO terms; the most commonly reported feature was Fatigue (median 45.1%) and the least commonly reported was Nausea (median 3.9%), but the reported percentages varied widely between studies. INTERPRETATION: Translating long COVID manifestations into computable HPO terms will improve analysis, data capture, and classification of long COVID patients. If researchers, clinicians, and patients share a common language, then studies can be compared/pooled more effectively. Furthermore, mapping lay terminology to HPO will help patients assist clinicians and researchers in creating phenotypic characterizations that are computationally accessible, thereby improving the stratification, diagnosis, and treatment of long COVID. FUNDING: U24TR002306; UL1TR001439; P30AG024832; GBMF4552; R01HG010067; UL1TR002535; K23HL128909; UL1TR002389; K99GM145411.


Assuntos
COVID-19/complicações , COVID-19/patologia , COVID-19/diagnóstico , Humanos , SARS-CoV-2 , Síndrome de COVID-19 Pós-Aguda
8.
J Am Med Inform Assoc ; 29(4): 609-618, 2022 03 15.
Artigo em Inglês | MEDLINE | ID: covidwho-1443051

RESUMO

OBJECTIVE: In response to COVID-19, the informatics community united to aggregate as much clinical data as possible to characterize this new disease and reduce its impact through collaborative analytics. The National COVID Cohort Collaborative (N3C) is now the largest publicly available HIPAA limited dataset in US history with over 6.4 million patients and is a testament to a partnership of over 100 organizations. MATERIALS AND METHODS: We developed a pipeline for ingesting, harmonizing, and centralizing data from 56 contributing data partners using 4 federated Common Data Models. N3C data quality (DQ) review involves both automated and manual procedures. In the process, several DQ heuristics were discovered in our centralized context, both within the pipeline and during downstream project-based analysis. Feedback to the sites led to many local and centralized DQ improvements. RESULTS: Beyond well-recognized DQ findings, we discovered 15 heuristics relating to source Common Data Model conformance, demographics, COVID tests, conditions, encounters, measurements, observations, coding completeness, and fitness for use. Of 56 sites, 37 sites (66%) demonstrated issues through these heuristics. These 37 sites demonstrated improvement after receiving feedback. DISCUSSION: We encountered site-to-site differences in DQ which would have been challenging to discover using federated checks alone. We have demonstrated that centralized DQ benchmarking reveals unique opportunities for DQ improvement that will support improved research analytics locally and in aggregate. CONCLUSION: By combining rapid, continual assessment of DQ with a large volume of multisite data, it is possible to support more nuanced scientific questions with the scale and rigor that they require.


Assuntos
COVID-19 , Estudos de Coortes , Confiabilidade dos Dados , Health Insurance Portability and Accountability Act , Humanos , Estados Unidos
9.
Diabetes Care ; 44(7): 1564-1572, 2021 07.
Artigo em Inglês | MEDLINE | ID: covidwho-1405389

RESUMO

OBJECTIVE: To determine the respective associations of premorbid glucagon-like peptide-1 receptor agonist (GLP1-RA) and sodium-glucose cotransporter 2 inhibitor (SGLT2i) use, compared with premorbid dipeptidyl peptidase 4 inhibitor (DPP4i) use, with severity of outcomes in the setting of severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2) infection. RESEARCH DESIGN AND METHODS: We analyzed observational data from SARS-CoV-2-positive adults in the National COVID Cohort Collaborative (N3C), a multicenter, longitudinal U.S. cohort (January 2018-February 2021), with a prescription for GLP1-RA, SGLT2i, or DPP4i within 24 months of positive SARS-CoV-2 PCR test. The primary outcome was 60-day mortality, measured from positive SARS-CoV-2 test date. Secondary outcomes were total mortality during the observation period and emergency room visits, hospitalization, and mechanical ventilation within 14 days. Associations were quantified with odds ratios (ORs) estimated with targeted maximum likelihood estimation using a super learner approach, accounting for baseline characteristics. RESULTS: The study included 12,446 individuals (53.4% female, 62.5% White, mean ± SD age 58.6 ± 13.1 years). The 60-day mortality was 3.11% (387 of 12,446), with 2.06% (138 of 6,692) for GLP1-RA use, 2.32% (85 of 3,665) for SGLT2i use, and 5.67% (199 of 3,511) for DPP4i use. Both GLP1-RA and SGLT2i use were associated with lower 60-day mortality compared with DPP4i use (OR 0.54 [95% CI 0.37-0.80] and 0.66 [0.50-0.86], respectively). Use of both medications was also associated with decreased total mortality, emergency room visits, and hospitalizations. CONCLUSIONS: Among SARS-CoV-2-positive adults, premorbid GLP1-RA and SGLT2i use, compared with DPP4i use, was associated with lower odds of mortality and other adverse outcomes, although DPP4i users were older and generally sicker.


Assuntos
COVID-19 , Diabetes Mellitus Tipo 2 , Receptor do Peptídeo Semelhante ao Glucagon 1/agonistas , Inibidores do Transportador 2 de Sódio-Glicose , Adulto , Idoso , COVID-19/diagnóstico , Diabetes Mellitus Tipo 2/tratamento farmacológico , Feminino , Humanos , Estudos Longitudinais , Masculino , Pessoa de Meia-Idade , Inibidores do Transportador 2 de Sódio-Glicose/uso terapêutico , Estados Unidos
10.
JAMA Netw Open ; 4(7): e2116901, 2021 07 01.
Artigo em Inglês | MEDLINE | ID: covidwho-1306627

RESUMO

Importance: The National COVID Cohort Collaborative (N3C) is a centralized, harmonized, high-granularity electronic health record repository that is the largest, most representative COVID-19 cohort to date. This multicenter data set can support robust evidence-based development of predictive and diagnostic tools and inform clinical care and policy. Objectives: To evaluate COVID-19 severity and risk factors over time and assess the use of machine learning to predict clinical severity. Design, Setting, and Participants: In a retrospective cohort study of 1 926 526 US adults with SARS-CoV-2 infection (polymerase chain reaction >99% or antigen <1%) and adult patients without SARS-CoV-2 infection who served as controls from 34 medical centers nationwide between January 1, 2020, and December 7, 2020, patients were stratified using a World Health Organization COVID-19 severity scale and demographic characteristics. Differences between groups over time were evaluated using multivariable logistic regression. Random forest and XGBoost models were used to predict severe clinical course (death, discharge to hospice, invasive ventilatory support, or extracorporeal membrane oxygenation). Main Outcomes and Measures: Patient demographic characteristics and COVID-19 severity using the World Health Organization COVID-19 severity scale and differences between groups over time using multivariable logistic regression. Results: The cohort included 174 568 adults who tested positive for SARS-CoV-2 (mean [SD] age, 44.4 [18.6] years; 53.2% female) and 1 133 848 adult controls who tested negative for SARS-CoV-2 (mean [SD] age, 49.5 [19.2] years; 57.1% female). Of the 174 568 adults with SARS-CoV-2, 32 472 (18.6%) were hospitalized, and 6565 (20.2%) of those had a severe clinical course (invasive ventilatory support, extracorporeal membrane oxygenation, death, or discharge to hospice). Of the hospitalized patients, mortality was 11.6% overall and decreased from 16.4% in March to April 2020 to 8.6% in September to October 2020 (P = .002 for monthly trend). Using 64 inputs available on the first hospital day, this study predicted a severe clinical course using random forest and XGBoost models (area under the receiver operating curve = 0.87 for both) that were stable over time. The factor most strongly associated with clinical severity was pH; this result was consistent across machine learning methods. In a separate multivariable logistic regression model built for inference, age (odds ratio [OR], 1.03 per year; 95% CI, 1.03-1.04), male sex (OR, 1.60; 95% CI, 1.51-1.69), liver disease (OR, 1.20; 95% CI, 1.08-1.34), dementia (OR, 1.26; 95% CI, 1.13-1.41), African American (OR, 1.12; 95% CI, 1.05-1.20) and Asian (OR, 1.33; 95% CI, 1.12-1.57) race, and obesity (OR, 1.36; 95% CI, 1.27-1.46) were independently associated with higher clinical severity. Conclusions and Relevance: This cohort study found that COVID-19 mortality decreased over time during 2020 and that patient demographic characteristics and comorbidities were associated with higher clinical severity. The machine learning models accurately predicted ultimate clinical severity using commonly collected clinical data from the first 24 hours of a hospital admission.


Assuntos
COVID-19 , Bases de Dados Factuais , Previsões , Hospitalização , Modelos Biológicos , Índice de Gravidade de Doença , Adulto , Idoso , Idoso de 80 Anos ou mais , COVID-19/etnologia , COVID-19/mortalidade , Comorbidade , Etnicidade , Oxigenação por Membrana Extracorpórea , Feminino , Humanos , Concentração de Íons de Hidrogênio , Masculino , Pessoa de Meia-Idade , Pandemias , Respiração Artificial , Estudos Retrospectivos , Fatores de Risco , SARS-CoV-2 , Estados Unidos , Adulto Jovem
SELEÇÃO DE REFERÊNCIAS
DETALHE DA PESQUISA